Fix NameError in HuggingFaceGenerationAdapter.prepare_inputs_for_generation by whn09 · Pull Request #136 · aws-neuron/neuronx-distributed-inference

whn09 · 2026-04-22T03:42:44Z

Description

HuggingFaceGenerationAdapter.prepare_inputs_for_generation builds model_inputs with

"tensor_capture_hook": tensor_capture_hook,

but never extracts tensor_capture_hook from **kwargs, while input_capture_hook is extracted on the line immediately above it. Any call path that reaches this method — including adapter.generate() on a model whose forward() accepts a tensor_capture_hook kwarg (e.g. NeuronBaseForImageToText subclasses) — raises NameError: name 'tensor_capture_hook' is not defined.

This PR adds the missing one-line extraction, mirroring how input_capture_hook is handled.

Reproduction

Any call to adapter.generate(...) where the underlying model declares tensor_capture_hook in its forward signature triggers the bug. Minimal example:

from neuronx_distributed_inference.utils.hf_adapter import HuggingFaceGenerationAdapter
# adapter wraps any NeuronBaseForImageToText-based model (e.g. Qwen2-VL, Qwen3-VL)
adapter.generate(input_ids=..., attention_mask=..., max_new_tokens=8)
# -> NameError: name 'tensor_capture_hook' is not defined

Fix

src/neuronx_distributed_inference/utils/hf_adapter.py:

         input_capture_hook = kwargs.get("input_capture_hook", None)
+        tensor_capture_hook = kwargs.get("tensor_capture_hook", None)

Behavior for callers that do not pass tensor_capture_hook is unchanged (it stays None, which is what model_inputs["tensor_capture_hook"] = None already means in the existing code).

Testing

Manually verified that adapter.generate() on an image-to-text model no longer raises NameError.
No change to the public API surface; the only change is that a previously undefined local is now extracted from **kwargs with a None default, matching the pattern already used for input_capture_hook and accepted_indices.

Compatibility

Neuron SDK Version(s): 2.27.x (where the bug was introduced) and newer
Instance Type(s): Any
PyTorch Version: Any supported
Python Version: Any supported

…ration prepare_inputs_for_generation builds model_inputs with `"tensor_capture_hook": tensor_capture_hook` but never extracts tensor_capture_hook from **kwargs (while input_capture_hook is extracted in the line directly above it). Any call path that reaches this method — including adapter.generate() — raises NameError. The fix is to extract tensor_capture_hook from kwargs, mirroring how input_capture_hook is handled. No other change is needed because NeuronBaseForImageToText.forward already accepts tensor_capture_hook as a keyword argument. Co-Authored-By: Claude Opus 4.7 <noreply@anthropic.com>

aws-mgld · 2026-05-04T19:23:26Z

Hi @whn09 - thanks for the fix!

This has already landed on main via commit ef2731e. As a workaround until a future release, can you use the functionality in main? Please let us know if you encounter other issues - otherwise we'll go ahead and close this PR. Thanks again for the detailed repro!

whn09 mentioned this pull request Apr 22, 2026

[contrib] Add Qwen2.5-Omni-7B with full Neuron speech pipeline (Thinker+Talker+Token2Wav, TP=4) #135

Open

17 tasks

aws-mgld closed this May 4, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix NameError in HuggingFaceGenerationAdapter.prepare_inputs_for_generation#136

Fix NameError in HuggingFaceGenerationAdapter.prepare_inputs_for_generation#136
whn09 wants to merge 1 commit intoaws-neuron:mainfrom
whn09:fix/hf-adapter-tensor-capture-hook-nameerror

whn09 commented Apr 22, 2026

Uh oh!

aws-mgld commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

whn09 commented Apr 22, 2026

Description

Reproduction

Fix

Testing

Compatibility

Related

Uh oh!

aws-mgld commented May 4, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants